612 Introduced RECAP Search Alerts sweep index #4127

albertisfu · 2024-06-21T01:23:57Z

This PR introduces the Sweep index approach (discussed in #612) to send RECAP Search Alerts that might have been missed by the percolator approach during the day.

The cl_send_recap_alerts command will perform the following tasks:

Remove the RECAPSweepDocument index from the previous day and recreate it to get a clean index (I'll create a follow-up issue to apply the logic to store the last 7-14 days indices for debugging purposes).
Index all documents added/changed during the day from the main RECAP index to the RECAPSweepDocument. The indexing process uses the ES re-index API with a custom query for efficiency. The documents included in the re-index are:
- Dockets added/modified during the day.
- All RECAPDocuments from dockets added/modified during the day.
- Independent RECAPDocuments added/modified during the day.
- Dockets that are parents of RECAPDocuments added/modified independently during the day.
This ensures that every document that should be included in the day's alerts is indexed.
Considering that the re-index process can take considerable time depending on the number of documents, it is scheduled as an ES task. This task retrieves an ID to monitor its progress. Initially, the process will wait for one minute after the task is scheduled. Depending on the task's progress, the estimated waiting time will dynamically change before checking the task status again, repeating the process until the task is completed. If after 10 failed tries of getting the task status (possible due to a ES cluster overhead), the process is aborted and a error is logged so we can take an action manually.
Some variables are stored in Redis to make the command resumable in case of a failure or if the Pod dies:
- alert_sweep:re_index_completed: If the command fails after the re-index process is completed, this step can be skipped when the command is resumed.
- alert_sweep:query_date: This stores the date from which we are sending alerts. The idea is that the command can be started during the day we are sending alerts, and in case it is close to 00:00 of the next day, it will still know that the alerts belong to the previous day in the event of a failure occurring after midnight.
- alert_sweep:task_id: In case of failure during the ES re-index process, monitoring can continue from the previously scheduled task instead of starting a new one.
Send RT and Daily alerts. After the re-index process is completed, Real-time and Daily alerts are sent. The process is the same for both rates:
- Get alerts for each user.
- Filter out RT alerts for non-members.
- Search each query alert against the RECAPSweepDocument and retrieve the hits.
- Each query is limited to retrieving up to SCHEDULED_ALERT_HITS_LIMIT hits, defaulting to 20 (the current max number of hits in ES OA Search Alerts).
- Hits are processed to ensure only the correct hits are included and no hits/alerts are duplicated:
  - According to the alert query and the hits returned, the alert can be classified within three groups:
    - Docket-only alert: Queries that don't include any child filter or child field in the text query using advanced search syntax. If the hit contains child documents (matched by a docket field), child documents are filtered, ensuring no child field was highlighted. This helps differentiate whether an alert is Docket-only.
    - RECAP-only query: Alerts that can only match RECAPDocuments, such as if only child filters are used or if the text query only matches a child field like description or plain_text.
    - Cross-object query: Alerts that match both Docket and RECAPDocument fields, such as a combination of a case_name and document_number filter, or a text query matching case_name and plain_text simultaneously.
  The reason to differentiate alert types is to avoid sending alerts incorrectly based on matched content.
  
  In practice, we only need to differentiate Docket-only alerts from RECAP-only or Cross-object alerts. If a hit in an alert doesn't include RD fields in the query or filters and the hit doesn't match RD highlights (Docket-only), we need to ensure the matched Docket was added/updated during the day. This ensures the alert should be triggered and no child hits are included. To confirm that, the Docket date_modified must belong to the same day, indicating the Docket was added or modified that day. We want to avoid cases where a Docket is indexed due to one of its RECAPDocuments being updated independently.
  
  For RECAP-only and Cross-object alerts, RECAPDocuments are matched as inner hits. The filtering process confirms that the query contains a child field as a filter or within the text query using advanced syntax, or if a child field is highlighted. If true, the child hit is included in the alert.
An additional filter checks if the Docket hit or the RECAPDocument hit has already triggered the same alert. We keep two sets per alert:
- alert_hits:id.d stores Dockets that have triggered an alert.
- alert_hits:id.r stores RECAPDocuments that have triggered the alert.
For Docket-only alerts, we check if the Docket hit ID is already within alert_hits:id.d. If so, the hit is excluded from the alert. For RECAP-only or Cross-object alerts, we check if the RD hits are within alert_hits:id.r. Only RDs not in the set are included in the alert. If all RD hits have previously triggered the alert, the hit is omitted from the alert.
Finally, after filtering hits and child hits, alert emails are sent, along with their related webhooks.
WLY and MLY rates:
- The process is similar to that described for RT and DLY rates for filtering hits. However, emails are not sent immediately. Instead, they are stored as ScheduledAlertHit to be sent according to their rate by the cl_send_scheduled_alerts command.
- WLY and MLY webhooks are sent immediately, similarly to how WLY and MLY ES OA alerts are triggered by the percolator.
The Alerts UI is enabled for RECAP Search behind a waffle flag:

Here are some examples of alert emails:

Docket-only alert: Only the Docket is included in the alert with no child hits.
RECAPDocument-only alert: The docket fields are shown with the RECAPDocument nested below the docket fields.
Cross-object alert: Here we can see how multiple cases are includes in the alert. In this case, the cross-object alert matched a hit by its case_name and also matched a RD belonging to the case that included the keywords within the document description. The second hit only matched the case by its case_name with no RDs matched.

Also you can notice the View Additional Results for this case is shown in the first case.

This is because the original search matched more RECAPDocuments due to the case_name being indexed into each RD, which is the behavior in the frontend.

In the alert, we filter out the RDs that actually matched the alert.
One question here: should we keep the View Additional Results button as in the frontend, or only show it if there are still 5 RDs matched after filtering? It's important to note that when clicking that button in the frontend, more results can be shown since it includes RDs filtered from the alert.

Notes and additional questions:

Highlighted fields are the same as in the frontend.
Let me know if any other Docket or RECAPDocument fields should be included in the alerts.
What would be the change in the subject or content of the RT alerts missed by the percolator that are sent by the sweep index?
Should we trigger the alert if it’s a cross-object alert that matches both a Docket and RDs that have been updated durint the same day but all the RDs matched have already triggered that alert? Should we send the alert only including the Docket, or omit it? Considering it is a cross-object alert, it is expected to see both the Docket and RDs in the alert.

cl/lib/elasticsearch_utils.py

- Test filter out queries and hits based on fields that matched.

for more information, see https://pre-commit.ci

cl/alerts/management/commands/cl_send_recap_alerts.py

… index command.

- Added tests to assert nested child documents in case alerts.

semgrep-app · 2024-06-28T02:24:24Z

cl/alerts/templates/alert_email_es.html

+ {% endif %}
+ {% if doc.plain_text %}
+ {% contains_highlights doc.plain_text.0 True as highlighted %}
+ <span style="display: block; margin-top: 5px;">{% if highlighted %}&hellip; {% endif %}{{ doc.plain_text|render_string_or_list|safe|underscore_to_space }} &hellip;</span>


Detected a segment of a Flask template where autoescaping is explicitly disabled with '| safe' filter. This allows rendering of raw HTML in this segment. Ensure no user data is rendered here, otherwise this is a cross-site scripting (XSS) vulnerability.
_{Ignore this finding from template-unescaped-with-safe.}

semgrep-app · 2024-06-28T02:24:25Z

cl/alerts/templates/alert_email_es.html

+ <a href="https://www.courtlistener.com{% if doc.absolute_url %}{{ doc.absolute_url }}{% else %}{{ result.docket_absolute_url }}#minute-entry-{{ doc.docket_entry_id }}{% endif %}" class="visitable">{% if doc.short_description %}{{ doc.short_description|render_string_or_list|safe }}<span class="gray">&nbsp;&mdash;&nbsp;</span>{% endif %}Document #{% if doc.document_number %}{{ doc.document_number }}{% endif %}{% if doc.attachment_number %}, Attachment #{{ doc.attachment_number }}{% endif %}
+ </a>
+ {% if doc.description %}
+ <span style="display: block; margin-top: 5px;">Description: {{ doc.description|render_string_or_list|safe }}</span>


Detected a segment of a Flask template where autoescaping is explicitly disabled with '| safe' filter. This allows rendering of raw HTML in this segment. Ensure no user data is rendered here, otherwise this is a cross-site scripting (XSS) vulnerability.
_{Ignore this finding from template-unescaped-with-safe.}

semgrep-app · 2024-06-28T02:24:25Z

cl/alerts/templates/alert_email_es.html

+ {% for doc in result.child_docs %}
+ {% with doc=doc|get_attrdict:"_source" %}
+ <li>
+ <a href="https://www.courtlistener.com{% if doc.absolute_url %}{{ doc.absolute_url }}{% else %}{{ result.docket_absolute_url }}#minute-entry-{{ doc.docket_entry_id }}{% endif %}" class="visitable">{% if doc.short_description %}{{ doc.short_description|render_string_or_list|safe }}<span class="gray">&nbsp;&mdash;&nbsp;</span>{% endif %}Document #{% if doc.document_number %}{{ doc.document_number }}{% endif %}{% if doc.attachment_number %}, Attachment #{{ doc.attachment_number }}{% endif %}


Detected a segment of a Flask template where autoescaping is explicitly disabled with '| safe' filter. This allows rendering of raw HTML in this segment. Ensure no user data is rendered here, otherwise this is a cross-site scripting (XSS) vulnerability.
_{Ignore this finding from template-unescaped-with-safe.}

semgrep-app · 2024-06-28T02:24:26Z

cl/alerts/templates/alert_email_es.html

 {{ forloop.counter }}. {{ result|get_highlight:"caseName"|safe }}
- ({% if result.court_id != 'scotus' %}{{ result|get_highlight:"court_citation_string"|nbsp|safe }}&nbsp;{% endif %}{% if type == 'o' %}{{ result.dateFiled|date:"Y" }}{% elif type == 'oa' %}{{ result.dateArgued|date:"Y" }}{% endif %})
+ ({% if result.court_id != 'scotus' %}{{ result|get_highlight:"court_citation_string"|nbsp|safe }}&nbsp;{% endif %}{% if type == 'o' %}{{ result.dateFiled|date:"Y" }}{% elif type == 'oa' %}{{ result.dateArgued|date:"Y" }}{% elif type == 'r' %}{{ result.dateFiled|date:"Y" }}{% endif %})


Detected a segment of a Flask template where autoescaping is explicitly disabled with '| safe' filter. This allows rendering of raw HTML in this segment. Ensure no user data is rendered here, otherwise this is a cross-site scripting (XSS) vulnerability.
_{Ignore this finding from template-unescaped-with-safe.}

semgrep-app · 2024-06-28T02:24:28Z

Semgrep found 1 template-unescaped-with-safe finding:

cl/alerts/templates/alert_email_es.html
- L107 - Triage

Detected a segment of a Flask template where autoescaping is explicitly disabled with '| safe' filter. This allows rendering of raw HTML in this segment. Ensure no user data is rendered here, otherwise this is a cross-site scripting (XSS) vulnerability.

_{Ignore this finding from template-unescaped-with-safe.}

…ing the Re Index API

- Enabled RECAP Search alerts UI behind a waffle. - Added alert frequency estimation for RECAP

semgrep-app · 2024-07-02T16:21:19Z

Semgrep found 6 baseclass-attribute-override findings:

cl/search/documents.py
- L1832 - Triage
- L1832 - Triage
- L1832 - Triage
- L1832 - Triage
- L1832 - Triage
- L1832 - Triage

Class RECAPSweepDocument inherits from both DocketDocument and ESRECAPDocument which both have a method named prepare_trustee_str; one of these methods will be overwritten.

_{Ignore this finding from baseclass-attribute-override.}

…ets + RD hits - Fixed RECAP MLY and WLY scheduled alerts content.

cl/lib/elasticsearch_utils.py

…dex. - Ensure document timestamps get updated on partial updates.

- Fixed email templates - Refactored retrieve_task_info

…g extra Docket-only and RD-only queries.

semgrep-app · 2024-07-10T02:21:07Z

cl/lib/elasticsearch_utils.py

+ child_search = child_search.extra(
+ from_=0,
+ size=settings.SCHEDULED_ALERT_HITS_LIMIT
+ * settings.RECAP_CHILD_HITS_PER_RESULT,
+ )


QuerySet.extra' does not provide safeguards against SQL injection and requires very careful use. SQL injection can lead to critical data being stolen by attackers. Instead of using '.extra', use the Django ORM and parameterized queries such as People.objects.get(name='Bob').
_{Ignore this finding from avoid-query-set-extra.}

semgrep-app · 2024-07-10T02:21:08Z

cl/lib/elasticsearch_utils.py

+ parent_search = parent_search.extra(
+ from_=0, size=settings.SCHEDULED_ALERT_HITS_LIMIT
+ )


QuerySet.extra' does not provide safeguards against SQL injection and requires very careful use. SQL injection can lead to critical data being stolen by attackers. Instead of using '.extra', use the Django ORM and parameterized queries such as People.objects.get(name='Bob').
_{Ignore this finding from avoid-query-set-extra.}

- Fixed issues and improved command resumability

albertisfu · 2024-07-11T01:20:03Z

I’ve added the solution to the issue I found while working on the Percolator approach.

The problem was as follows:

The previous approach to determine whether an alert was a Docket-only query or a RECAPDocument-only/Cross-object alert query relied on two checks:

There was a method called query_includes_rd_field which checked if a RECAPDocument field was included either in a filter or within the text query as a fielded search.

If no RD fields were found in the query, it was classified as a possible "Docket-only" query.

An additional step was performed to confirm if the alert was Docket-only by checking if the Hit contained nested RECAPDocuments and if any of the RD fields were HL.

The theory behind this was that if an alert contained an RD field as a filter or a fielded text query, like:

case_name: "lorem ipsum"
document_number: 1

or:

q: document_number:1 docket_number:"23-2345"

It would always match a Docket with at least one RECAPDocument. Therefore, these types of queries couldn’t be considered Docket-only queries, which is true for filters since they're always combined internally with an AND.

However, I realized that queries like the following:

q: document_number:1 OR docket_id:22345

can match either only empty Dockets or Dockets + RDs. For instance:
https://www.courtlistener.com/?q=docket_id%3A40661867%20OR%20document_number%3A1&type=r&order_by=dateFiled%20desc

So, these kinds of queries can't be considered as RECAPDocument-only alerts or cross-object alerts since they partially behave as Docket-only queries.

This issue can only occur with simple text queries that could match either a Docket or a RECAPDocument field, for instance:

q: "United states"

https://www.courtlistener.com/?q=%22United+States%22&type=r&order_by=score+desc&

where it can match either Dockets with no documents or Dockets with RECAPDocuments due to containing the search terms within one of their searchable fields.

So, the method of using query_includes_rd_field to detect whether an alert is Docket-only or not was not reliable.

The problems with not correctly classifying Docket-only hits versus RECAPDocument-only or Cross-object hits were two:

When a hit is classified as Docket-only, additional filters are performed to determine if the hit should be included in the alert. The first check is if the Docket hit has already triggered that alert in the past; if so, it's omitted. The second filter is related to the date the docket was modified. Since Dockets can be indexed into the sweep index due to any of their child documents being added or modified during the day, Docket-only alerts can be matched even if the docket didn't change during the day. To solve this issue, a check confirms if the docket was modified during the same day; if so, the alert is triggered. If an alert is not properly classified as Docket-only, these filters could be omitted, leading to triggered alerts when they shouldn't be.
The second issue was that query_includes_rd_field acted at the alert level. For instance, if it indicated the alert contained RD fields, this assumption applied to all the hits in the alert. This is incorrect, as we can see in previous examples where there can be hits that only matched a Docket with no RDs or hits that matched a docket and RDs. Doing this at the alert level could have led to miss RDs into cross-object hits that were incorrectly classified.

The solution.

Solution applied was to perform two additional queries alongside the main query: one that can only match Dockets (Docket-only query) and one that can only match RECAPDocuments (RECAPDocument-only query).

The additional Docket-only query is performed against the main sweep index but only targets Docket documents, and the results in this query only retrieve the docket_id, which is the only field needed to filter results returned by the main query.

The second additional query, the RECAP-only one, is performed against a different new index called recap_document_sweep. It was necessary to create a new index because we need to index RECAPDocuments including exclusively RECAPDocument fields (compared to the main index where RECAPDocuments also included Docket fields indexed). So, when running the query, we can know which RDs were matched by the RECAPDocument-only query and consider them as RECAP-only hits.

The re-index process also changed. In addition to the main re-index, an additional re-index is performed, which copies only RECAPDocuments indexed or modified during the day to the recap_document_sweep index, considering only RECAPDocument fields.

During the alert query process, three queries are sent to ES in a single request: one for the main query, one for the Docket-only query, and one for the RECAPDocument-only query. The latter two are faster since they're not join queries and only retrieve the docket_id and the document id in the results source.

The results of these two additional queries are used as follows:

For each docket hit returned by the main query, it checks if the docket hit ID is also contained within the docket-only query results. If so, it's a possible docket-only hit. Then, as a second step, RECAPDocuments matched are filtered using the results returned by the RECAPDocument-only query. Each RECAPDocument hit ID is checked if it’s included within the RECAPDocument-only query results; if so, it should be included within the results.

If, after performing the previous filters, no RDs remain in the docket hit, it's considered a Docket-only hit, and no RECAPDocuments are nested within the hit.

Also, if a docket returned by the main query is not found in the docket-only query results, it's directly classified as a RECAPDocument-only hit or a cross-object hit, and the additional corresponding filters are applied.

So following this new approach solved the issue, and now the sweep index approach is more reliable and will trigger alerts similarly to the Percolator approach.

semgrep-app · 2024-07-19T15:33:58Z

cl/search/documents.py

@@ -1826,3 +1831,24 @@ def prepare_non_participating_judge_ids(self, instance):

 def prepare_cluster_child(self, instance):
 return "opinion_cluster"
+
+
+class RECAPSweepDocument(DocketDocument, ESRECAPDocument):


Class RECAPSweepDocument inherits from both DocketDocument and ESRECAPDocument which both have a method named $F; one of these methods will be overwritten.
_{Ignore this finding from baseclass-attribute-override.}

…due to scheduled task - This can be removed after tasks in the queue have been processed.

…CAP_CHILD_HITS_PER_RESULT value

semgrep-app · 2024-09-26T23:00:17Z

cl/lib/elasticsearch_utils.py

+ child_total_query = child_docs_count_query.extra(
+ size=0, track_total_hits=True
+ )


QuerySet.extra' does not provide safeguards against SQL injection and requires very careful use. SQL injection can lead to critical data being stolen by attackers. Instead of using '.extra', use the Django ORM and parameterized queries such as People.objects.get(name='Bob').
_{Ignore this finding from avoid-query-set-extra.}

semgrep-app · 2024-09-26T23:00:18Z

cl/lib/elasticsearch_utils.py

+ main_doc_count_query = main_doc_count_query.extra(
+ size=0, track_total_hits=True
+ )


QuerySet.extra' does not provide safeguards against SQL injection and requires very careful use. SQL injection can lead to critical data being stolen by attackers. Instead of using '.extra', use the Django ORM and parameterized queries such as People.objects.get(name='Bob').
_{Ignore this finding from avoid-query-set-extra.}

…ies returning values change

…n HL to filter out RD hits

albertisfu · 2024-09-27T17:12:14Z

Just confirming that the RECAP Search Alerts UI will be controlled by recap-alerts-active waffle flag.

ERosendo · 2024-10-08T14:27:42Z

cl/custom_filters/templatetags/extras.py

+ request = context["request"]
+ return (
+ search_type == SEARCH_TYPES.OPINION
+ or search_type == SEARCH_TYPES.ORAL_ARGUMENT
+ or (
+ search_type == SEARCH_TYPES.RECAP
+ and waffle.flag_is_active(request, "recap-alerts-active")
+ )
+ )


I believe we can refactor this code to make it more readable.

Suggested change

request = context["request"]

return (

search_type == SEARCH_TYPES.OPINION

or search_type == SEARCH_TYPES.ORAL_ARGUMENT

or (

search_type == SEARCH_TYPES.RECAP

and waffle.flag_is_active(request, "recap-alerts-active")

)

)

request = context["request"]

if search_type == SEARCH_TYPES.RECAP:

return waffle.flag_is_active(request, "recap-alerts-active")

return search_type in (SEARCH_TYPES.OPINION, SEARCH_TYPES.ORAL_ARGUMENT)

ERosendo · 2024-10-08T14:30:32Z

cl/alerts/utils.py

+from cl.lib.types import CleanData
+from cl.search.constants import (
+ ALERTS_HL_TAG,
+ SEARCH_RECAP_CHILD_HL_FIELDS,
+ recap_document_filters,
+ recap_document_indexed_fields,
+)


We're not using any of these elements. Let's remove them to clean up the imports.

ERosendo · 2024-10-08T14:46:01Z

cl/search/constants.py

+recap_document_indexed_fields = [
+ "id",
+ "docket_entry_id",
+ "description",
+ "entry_number",
+ "entry_date_filed",
+ "short_description",
+ "document_type",
+ "document_number",
+ "pacer_doc_id",
+ "plain_text",
+ "attachment_number",
+ "is_available",
+ "page_count",
+ "cites",
+]
+
+recap_document_filters = [
+ "available_only",
+ "description",
+ "document_number",
+ "attachment_number",
+]


These arrays are unused. They're likely remnants of an older approach. We can remove them.

ERosendo · 2024-10-08T16:53:24Z

cl/lib/elasticsearch_utils.py

@@ -1073,22 +1076,25 @@ def build_es_base_query(
 cd: CleanData,
 child_highlighting: bool = True,
 api_version: Literal["v3", "v4"] | None = None,
-) -> tuple[Search, QueryString | None]:
+ alerts: bool = False,
+) -> tuple[Search, QueryString | None, QueryString | None]:


I think we should use a dataclass instead of a tuple as the return type for this helper method. During my initial review of the PR, I consistently assumed the parent query was the second element in the tuple. I only realized the correct order (child query, parent query) after referring to the docstring 😅

Also, the addition of a new element to the tuple has increased the number of _ variables used to discard unwanted values.

albertisfu added 2 commits June 20, 2024 19:22

fix(elasticsearch): Test RECAP nested index reliability

3e4f269

Merge branch 'main' into 612-introduced-recap-search-alerts

53b3b65

semgrep-app bot reviewed Jun 21, 2024

View reviewed changes

cl/lib/elasticsearch_utils.py Outdated Show resolved Hide resolved

albertisfu mentioned this pull request Jun 21, 2024

Add "Create an Alert" for RECAP searches #612

Open

albertisfu and others added 3 commits June 21, 2024 20:35

fix(alerts): Changed sweep index approach to parent-child documents

2955b0b

fix(alerts): Added cl_send_recap_alerts command

9307b77

- Test filter out queries and hits based on fields that matched.

[pre-commit.ci] auto fixes from pre-commit.com hooks

9b4e1c1

for more information, see https://pre-commit.ci

semgrep-app bot reviewed Jun 25, 2024

View reviewed changes

cl/alerts/management/commands/cl_send_recap_alerts.py Outdated Show resolved Hide resolved

semgrep-app bot reviewed Jun 25, 2024

View reviewed changes

cl/alerts/management/commands/cl_send_recap_alerts.py Outdated Show resolved Hide resolved

albertisfu added 3 commits June 26, 2024 20:51

fix(alerts): Implemented filtering of RECAP alerts hits for the sweep…

8b537f0

… index command.

fix(alerts): Updated ES alert email templates to support RECAP Alerts.

c1232ec

- Added tests to assert nested child documents in case alerts.

Merge branch 'main' into 612-introduced-recap-search-alerts

3e96f61

semgrep-app bot reviewed Jun 28, 2024

View reviewed changes

albertisfu added 5 commits June 28, 2024 16:43

fix(alerts): Group alerts and case hits limit

51c7bb6

fix(alerts): Trigger RECAP search alerts webhooks

7fc3298

fix(alerts): Schedule wly and mly RECAP Search Alerts

b5016ba

fix(alerts): Copy documents from the main index to the sweep index us…

4a128bf

…ing the Re Index API

fix(alerts): Fixed RECAPSweepDocument index mapping

3a4a456

- Enabled RECAP Search alerts UI behind a waffle. - Added alert frequency estimation for RECAP

albertisfu added 2 commits July 2, 2024 19:41

fix(alerts): Tweak RECAP Alert estimation query to consider both Dock…

add980a

…ets + RD hits - Fixed RECAP MLY and WLY scheduled alerts content.

Merge branch 'main' into 612-introduced-recap-search-alerts

a20113f

semgrep-app bot reviewed Jul 3, 2024

View reviewed changes

cl/lib/elasticsearch_utils.py Show resolved Hide resolved

semgrep-app bot reviewed Jul 3, 2024

View reviewed changes

cl/lib/elasticsearch_utils.py Show resolved Hide resolved

albertisfu added 3 commits July 2, 2024 19:55

fix(elasticsearch): Fixed build_daterange_query type hint

ebf269d

fix(alerts): Fixed re_index task estimated remaining time compute

bffee6d

fix(alerts): Handle creation and removal of the RECAP alerts sweep in…

847f0fd

…dex. - Ensure document timestamps get updated on partial updates.

albertisfu added 2 commits July 3, 2024 19:27

fix(alerts): Fix should_docket_hit_be_included date comparison

0d63080

- Fixed email templates - Refactored retrieve_task_info

Merge branch 'main' into 612-introduced-recap-search-alerts

5b3d130

albertisfu requested a review from mlissner July 4, 2024 03:43

albertisfu marked this pull request as ready for review July 4, 2024 03:44

albertisfu mentioned this pull request Jul 4, 2024

Keep up to 7/14 days versions of the RECAP alerts sweep index #4168

Open

albertisfu added 2 commits July 9, 2024 20:17

fix(alerts): Changed approach to filter out cross-object hits by usin…

5077e01

…g extra Docket-only and RD-only queries.

Merge branch 'main' into 612-introduced-recap-search-alerts

9dffbfd

semgrep-app bot reviewed Jul 10, 2024

View reviewed changes

albertisfu added 2 commits July 10, 2024 16:43

fix(alerts): Added more tests related to filtering cross-object hits.

a4e4e62

- Fixed issues and improved command resumability

Merge branch 'main' into 612-introduced-recap-search-alerts

49dd480

Merge branch 'main' into 612-introduced-recap-search-alerts

a468336

semgrep-app bot reviewed Jul 19, 2024

View reviewed changes

albertisfu added 3 commits July 25, 2024 14:35

Merge branch 'main' into 612-introduced-recap-search-alerts

38d6884

fix(alerts): Restore send_es_search_alert_webhook to avoid conflicts …

b56f235

…due to scheduled task - This can be removed after tasks in the queue have been processed.

fix(alerts): Fixed MLY alerts test can't be sent after the 28th

d102664

albertisfu force-pushed the 612-introduced-recap-search-alerts branch from fc5720e to d102664 Compare July 29, 2024 16:09

albertisfu added 2 commits September 26, 2024 16:42

Merge branch 'main' into 612-introduced-recap-search-alerts

7977b80

fix(alerts): Fixed merge conflicts and adjust test accordingly new RE…

57b6df7

…CAP_CHILD_HITS_PER_RESULT value

semgrep-app bot reviewed Sep 26, 2024

View reviewed changes

albertisfu added 3 commits September 26, 2024 18:36

fix(elasticsearch): Fixed failing test due to build_full_join_es_quer…

b35ef0a

…ies returning values change

fix(alerts): Removed recap_document_hl_matched as we no longer rely o…

8902aa0

…n HL to filter out RD hits

Merge branch 'main' into 612-introduced-recap-search-alerts

d0b1298

ERosendo reviewed Oct 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

612 Introduced RECAP Search Alerts sweep index #4127

612 Introduced RECAP Search Alerts sweep index #4127

albertisfu commented Jun 21, 2024 •

edited

Loading

semgrep-app bot Jun 28, 2024

semgrep-app bot Jun 28, 2024

semgrep-app bot Jun 28, 2024

semgrep-app bot Jun 28, 2024

semgrep-app bot commented Jun 28, 2024

semgrep-app bot commented Jul 2, 2024

semgrep-app bot Jul 10, 2024

semgrep-app bot Jul 10, 2024

albertisfu commented Jul 11, 2024

semgrep-app bot Jul 19, 2024

semgrep-app bot Sep 26, 2024

semgrep-app bot Sep 26, 2024

albertisfu commented Sep 27, 2024

ERosendo Oct 8, 2024

ERosendo Oct 8, 2024

ERosendo Oct 8, 2024

ERosendo Oct 8, 2024

612 Introduced RECAP Search Alerts sweep index #4127

Are you sure you want to change the base?

612 Introduced RECAP Search Alerts sweep index #4127

Conversation

albertisfu commented Jun 21, 2024 • edited Loading

semgrep-app bot Jun 28, 2024

Choose a reason for hiding this comment

semgrep-app bot Jun 28, 2024

Choose a reason for hiding this comment

semgrep-app bot Jun 28, 2024

Choose a reason for hiding this comment

semgrep-app bot Jun 28, 2024

Choose a reason for hiding this comment

semgrep-app bot commented Jun 28, 2024

semgrep-app bot commented Jul 2, 2024

semgrep-app bot Jul 10, 2024

Choose a reason for hiding this comment

semgrep-app bot Jul 10, 2024

Choose a reason for hiding this comment

albertisfu commented Jul 11, 2024

The solution.

semgrep-app bot Jul 19, 2024

Choose a reason for hiding this comment

semgrep-app bot Sep 26, 2024

Choose a reason for hiding this comment

semgrep-app bot Sep 26, 2024

Choose a reason for hiding this comment

albertisfu commented Sep 27, 2024

ERosendo Oct 8, 2024

Choose a reason for hiding this comment

ERosendo Oct 8, 2024

Choose a reason for hiding this comment

ERosendo Oct 8, 2024

Choose a reason for hiding this comment

ERosendo Oct 8, 2024

Choose a reason for hiding this comment

albertisfu commented Jun 21, 2024 •

edited

Loading